Development of a data-driven scientific methodology: From articles to chemometric data products
نویسندگان
چکیده
Information and data science algorithms were combined to predict the outcome of an experiment in chemical engineering. Using Scientific Method workflow, we started journey with formulation a specific question. At research stage, common process querying reading articles on scientific databases was substituted by systematic review built-in recursive mining method. This procedure identifies community knowledge key concepts experiments that are necessary address formulated A small subset relevant from very topic among thousands papers identified while assuring loss least amount information through process. The secondary dataset bigger than individual study. revealed main ideas currently under study optimal synthesis conditions produce substance. Once step finished, experimental compiled prepared for meta-analysis using supervised learning algorithm. is hypothesis generation stage whereby transformed into about particular reaction. Finally, predicted sets desired compound validated laboratory.
منابع مشابه
Data Driven Techniques for Organizing Scientific Articles Relevant to Biomimicry
Life on earth presents elegant solutions to many of the challenges innovators and entrepreneurs across disciplines face every day. To facilitate innovations inspired by nature, there is an emerging need for systems that bring relevant biological information to this application-oriented market. In this paper, we discuss our approach to assembling a system that uses machine learning techniques to...
متن کاملEnhancing Learning from Imbalanced Classes via Data Preprocessing: A Data-Driven Application in Metabolomics Data Mining
This paper presents a data mining application in metabolomics. It aims at building an enhanced machine learning classifier that can be used for diagnosing cachexia syndrome and identifying its involved biomarkers. To achieve this goal, a data-driven analysis is carried out using a public dataset consisting of 1H-NMR metabolite profile. This dataset suffers from the problem of imbalanced classes...
متن کاملStatistical Data Editing in Scientific Articles
Scientific journals are important scholarly forums for sharing research findings. Editors have important roles in safeguarding standards of scientific publication and should be familiar with correct presentation of results, among other core competencies. Editors do not have access to the raw data and should thus rely on clues in the submitted manuscripts. To identify probable errors, they shoul...
متن کاملData-Driven Approaches to Improve the Quality of Clinical Processes: A Systematic Review
Background: Considering the emergence of electronic health records and their related technologies, an increasing attention is paid to data driven approaches like machine learning, data mining, and process mining. The aim of this paper was to identify and classify these approaches to enhance the quality of clinical processes. Methods: In order to determine the knowledge related to the research ...
متن کاملKUL: Data-driven Approach to Temporal Parsing of Newswire Articles
This paper describes a system for temporal processing of text, which participated in the Temporal Evaluations 2013 campaign. The system employs a number of machine learning classifiers to perform the core tasks of: identification of time expressions and events, recognition of their attributes, and estimation of temporal links between recognized events and times. The central feature of the propo...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Chemometrics and Intelligent Laboratory Systems
سال: 2022
ISSN: ['1873-3239', '0169-7439']
DOI: https://doi.org/10.1016/j.chemolab.2022.104555